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Abstract 

Using Wang-Landau sampling with suitable Monte Carlo trial moves (pull moves and bond-rebridging 
moves combined) we have determined the density of states and thermodynamic properties for a short se- 
quence of the HP protein model. For free chains these proteins are known to first undergo a collapse 
"transition" to a globule state followed by a second "transition" into a native state. When placed in the 
proximity of an attractive surface, there is a competition between surface adsorption and folding that leads 
to an intriguing sequence of "transitions" . These transitions depend upon the relative interaction strengths 
and are largely inaccessible to "standard" Monte Carlo methods. 
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1. Introduction 



2. Model and Methods 



Over the last few decades, considerable effort 
has been devoted to the protein folding problem 
and yet major challenges remain. The complex- 
ity arises from enormous combinatorics of the 20 
amino acids, forming multitudinous possible pro- 
tein sequences to which the structure is closely re- 
lated, and hence its biological function [1]. Com- 
phcated interactions among residues also result in 
rough free energy landscapes [H. Nevertheless, the 
underlying principles for protein structure predic- 
tion from the sequence are still unclear [3]. On the 
other hand, technical limitations also hinder exper- 
imental progress in obtaining protein structures us- 
ing x-ray diffraction or NMR. Computer simulation 
can shed light on the problem where both theoreti- 
cal and experimental studies are facing bottlenecks. 
Similar to the importance of the Ising model in the 
study of magnetism, simple protein models that em- 
brace merely essential features are amenable to at- 
tack by a wide range of numerical methods, includ- 
ing computer simulations. This also allows under- 
standing the folding problem from a macroscopic 
perspective without distraction from unnecessary 
details 11. 



The HP model is a simple, prototypical lattice 
protein model. It consists of only two types of 
monomers, hydrophobic (H) and polar (P), in a 
sequence chosen to mimic a real protein [5]. In- 
teractions are restricted to an attractive coupling, 
chh^ between non-bonded hydrophobic monomers 
occupying nearest-neighbor sites. These are intro- 
duced to capture the hydrophobic effect, which is 
considered as the "driving force" of protein folding 
in forming tertiary structures. Despite its simplic- 
ity, the HP model shows unexpectedly rich ther- 
modynamic behaviors and is challenging to study. 
Finding the ground state of an HP sequence is itself 
an NP-hard problem [gI, 0] • 

It should be pointed out that since the HP chain 
is constructed specifically to represent an individ- 
ual protein, thermodynamic and structural prop- 
erties of the chain depend on its chain length and 
the sequence of H and P monomers uniquely. In 
this case, there does not exist finite size scaling for 
this model, and thus there are no real phases and 
transitions in the thermodynamic limit. 

To understand how a protein interacts with an at- 
tractive substrate, a model surface which attracts 
both types of monomers with strength is intro- 
duced to the 3-dimensional lattice as an x^-plane 
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placed at 2: = 0. A second, non- attractive wall 
is placed at z = to bound the HP chain from 
above. Periodic boundaries are used for the x and 
y directions. The Hamiltonian of the model is then 
represented as: 



(1) 



uhh being the number of H-H interacting pairs and 
Us being the number of monomers adjacent to the 
bottom surface. Such a system has first been intro- 
duced and studied in Ref. [8]. 

We have implemented two inventive Monte Carlo 
trial moves, namely, pull moves ^ and bond- 
rebridging moves (lOj, which are efficient both in 



compact and elongated configurations Em- 
ploying Wang- Landau sampling [l2| we have ob- 
tained the density of states in energy g{E)^ from 
which the partition function Z can be calculated: 



(2) 



where k is the Boltzmann constant, T is temper- 
ature and the sum runs over all possible energies. 
The average energy {E) then follows: 



{E) = ^Y.^^9{E^)e 



-Ei/kT 



(3) 



and the specific heat can be calculated from the 
fluctuation of energy: 



' {{E')-{Ef). 



(4) 



In Wang-Landau sampling, g{E) is estimated it- 
eratively. During the simulation, a multiplicative 
modification factor / is used to update g{E) with 
an initial value finu = e^. A histogram in energy 
H{E) is accumulated besides g{E). When a "flat" 
histogram is attained, the simulation is brought to 
the next iteration: H{E) is reset and / is reduced, 
fnew = Vf- A histogram is said to be "flat" when 
all entries in H[E) are greater than p x Have-, where 
p is the flatness criterion and Have is the average of 
all entries in H{E). To yield reliable estimates for 
g{E)^ all results presented in this work are obtained 
by using a flatness criterion p = 0.8. The simulation 
stops when the natural log of the final modification 
factor, ln{f final) 1 reaches a preset minimum value 
of 10-^ 



3. Results 

As a first step, we have simulated an HP se- 
quence with 36 beads (PPPHHPPHHPPPPPHH- 
HHHHHPPHHPPPPHHPPHPP), which has ffist 
been studied in Ref. [l3|. In free space, the HP 
chain undergoes a coil-globule transition as temper- 
ature decreases, resulting in a compact hydrophobic 
core with polar residues residing on the exterior to 
screen the core from the polar solvents. See Figure 
[H Similar behaviors have also been observed for 
other lengths and sequences 13, [lH, 
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Figure 1: (Color online.) Ground state of the 36mer in free 
space {E = —18, uhh = 18)- The HP chain is displayed 
with larger polar and smaller hydrophobic residues. 

With a surface that attracts both H and P 
monomers, the HP chain exhibits a much richer 
"transition" hierarchy due to the competition be- 
tween surface adsorption and attraction within the 
polymer. To illustrate the effects of the surface 
strength on the "transition" behaviors, both a weak 
surface {eHH = 12, = 1) and a strong surface 



1) are considered. 15 independent 



runs were performed to obtain statistical errors in 
each case. CPU time grows with the size of the en- 
ergy range of the systems: it takes about 15 minutes 
to finish a simulation on an AMD Opteron dual-core 
2.2 GHz processor for the surface- free case using 19 
energy bins, 5-10 hours for the strong attractive 
surface case with 51 energy bins, but more than 10 
days are generally needed for the weak attractive 
surface where there are 242 energy bins in the full 
energy range. 

Figure [2] shows a typical specific heat of an HP 
chain in three-dimensional free space, as well as that 
when it interacts with a weakly attractive surface. 
In the latter case the height of the non-attractive 
wall is set to he = N -\- 1 = 37, i.e., there are 
36 layers between the two horizontal surfaces, the 
36mer can touch both surfaces with its ends only 
when it is a fully stretched, vertical chain. While 
there is only a single peak corresponding to the col- 
lapse "transition" at T/chh ^ 0.5 for a free chain. 
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three peaks are observed in the case with a weakly 
attractive surface. From this comparison it is ob- 
vious that the two peaks in the low temperature 
regime are due to the influence of the attractive 
substrate. 




Figure 2: Specific heat of a HP 36mer: (i) in a 3D free space, 
(ii) interacting with a weakly attractive surface (chh = 
12, Cs = 1, hw = 37). Error bars are smaller than the data 
points. 

The largest peak which overlaps with the free 
chain's specific heat represents the same coil- 
globule transition: during this stage, the HP poly- 
mer transforms from an extended coil-like structure 
to a desorbed, but compact, globule. Typical re- 
sulting structures are principally the same as the 
ground states of the free chain in the absence of the 
surface, as shown in Figure [3l 

The middle peak at T/chh ^ 0.18 signals ad- 
sorption during which the compact HP globule 
"docks" at the surface with the hydrophobic core 
remaining intact. The conformation spans several 
layers vertically and the total energy of the system 
is lowered slightly due to contact with the surface, 
by an amount dependent upon the number of sur- 
face contacts. 

Further decrease in temperature brings the sys- 
tem to the third transition at T/chh ^ 0.05 where 
it strives for a maximum number of surface inter- 
actions without sacrificing an intact, energetically 
minimized hydrophobic core. With a large value of 
^HH^ forming H-H contacts is immensely more en- 
ergetically favorable than forming surface contacts. 

A closer look at the low temperature regime re- 
veals a subtle shoulder at T/enH ^ 0.02, which 
has not been found in Ref. [13|- This shoulder is 
a crossover signal owing to a transition from the 



ground state to the first few excited states at low 
temperature, that is similar to the case reported 
in the investigation of freezing and collapse of ho- 
mopolymers [ll|, ll7| . In our system, this excitation 
is an effect purely due to the existence of the sur- 
face, which is supported by the analyses on other 
structural parameters (not presented here) and also 
by observing the typical states at T/chh = and 
T/cHH ^ 0.02. At T/cHH = where the polymer 
occupies just ground states, only rectangular cores 
are able to maximize the number of surface con- 
tacts. This gives the ground state energy E = —241 
{nHH = 18, = 25). An example of such a struc- 
ture is shown in the bottom left-hand corner of Fig- 
ure [3l At T/chh ^ 0.02 where first and second ex- 
cited states dominate, two core shapes are observed: 
rectangular and "L-shape" as shown in Figure IH 
Since these two cores have the same energetic con- 
tribution {riHH = 18, with 8 H monomers interact- 
ing with the surface for both cases), the excitation 
from the ground state to the first few excited states 
can only be due to the decrease in surface contacts 
of the P monomers. 




Figure 4: (Color online.) First excited states of the 36mer in- 
teracting with a weak attractive surface, E = —240 (uhh = 
18, Us = 24). The HP chains are displayed with larger polar 
and smaller hydrophobic residues. 

On the other hand, when the HP chain is brought 
near a strong surface field, the transition behavior 
is quite different from the previous case. We ob- 
serve only two peaks in the specific heat, see Fig- 
ure [H The one at higher temperature corresponds 
to an adsorption transition, and the other one, at 
T/chh ^ 0.3, corresponds to the collapse transi- 
tion that takes place on the surface. At high tem- 
perature, the extended HP chain first attaches to 
the surface until most monomers touch it. This ad- 
sorbed, yet expanded structure then undergoes a 
second transition to maximize the number of H- 
H contacts and finally achieves a film-like, two- 
dimensional compact structure. 

Unlike the previous case with a weakly attractive 
surface, there is no shoulder in the very low temper- 
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Figure 3: (Color online.) Specific heat of an HP 36mer interacting with a weak attractive surface (chh = 12, Cs = 1), 
hw = 37. Error bars smaller than the data points are not shown. Typical structures are displayed with larger polar and smaller 
hydrophobic residues. 



ature regime. This is because the excitation from 
the ground state {E = —^O^uhh = 14, = 36, 
as shown in the bottom left-hand corner of Fig- 
ure [5|) to the first excited states {E = —4:9^nHH = 
13, Us = 36) in this case is mainly achieved by the 
destruction of the H-H contacts, while the entire 
chain still remains completely attached to the sub- 
strate. Figure [6] shows two examples of first excited 
states having different hydrophobic cores from that 
of the ground state. Therefore, the effect of the ex- 
citation cannot be singled out to give an individual 
signal, but is rather included within the collapse 
transition peak. 

Figure [5] also shows the influence of the entropic 
effects on the transition behavior due to the height 
of the non-interacting wall bounding the chain 
from above. Obviously the low- temper ature col- 
lapse is not affected at all, as it takes place on the 
attractive surface which is not related to the steric 
upper surface. The desorption-adsorption transi- 
tion, however, shows systematic dependence on hw 
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Figure 6: (Color online.) First excited states of the 36mer in- 
teracting with a strong attractive surface, E = —49 (uhh = 
13, Us = 36). The HP chains are displayed with larger polar 
and smaller hydrophobic residues. 



A smaller h^ restricts the vertical movements of the 
HP chain to a larger extent, resulting in a smaller 
entropy gain (and thus a less pronounced peak in 
the specific heat) as less translational variations of 
the same configuration are allowed. The chain is 
also more likely to be on the attractive surface, re- 
sulting in a higher adsorption temperature. This 
dependence of the adsorption transition peak on h^ 
is also reported recently [18], where an off-lattice 
homopolymer model is used to study polymer ad- 



4 




Figure 5: (Color online.) Specific heat of an HP 36mer interacting with a strong attractive surface {enn = l,es = 1). Error 
bars smaller than the data points are not shown. Typical structures are displayed with larger polar and smaller hydrophobic 
residues. 



sorption. 



4. Summary and Outlook 



To summarize, we studied protein folding us- 
ing Wang-Landau sampling with a minimalistic lat- 
tice model, the HP model. Two Monte Carlo 
trial moves, pull moves and bond-rebridging moves, 
which work particularly well with our algorithm, 
were implemented. In this work, we have focused 
on investigating protein adsorption. With a weak 
surface, three-stage folding is identified, while two- 
stage folding is found with a strong surface. Vary- 
ing the height of the upper non-interacting surface 
regulates the total entropy of the system, which af- 
fects the higher temperature "transition". Further 
work in progress includes longer benchmark HP se- 
quences and the investigation of the thermodynam- 
ics of different structural quantities and their rela- 
tionship with different "transitions" . 
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